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I. INTRODUCTION 



In a perfect world the field of economics would not be divided into macroeconomics and microeconomics. The 
former would be derivable from the latter. Our current understanding of economics is reminiscent of the situation in 
statistical physics prior to the 1870s, when the well established field of thermodynamics had to be reconciled with 
the new atomic theory. The work of Boltzmann, Gibbs, Maxwell and others eventually achieved this reconciliation, 
demonstrating that the "macro" theory of thermodynamics is derivable from the "micro" atomic theory. Economists 
are still seeking this kind of unification in their field of study. 

The knowledge that economics is still incomplete has led some economists to take extreme positions. There is 
a school of thought that maintains that no paper on macroeconomics is worth publishing if it is not demonstrably 
grounded on "microfoundations" [1]. At the same time, over the course of the past twenty-five years, there has been 
widespread recognition that the very foundations of neoclassical economics - and microeconomics in particular - 
are deeply flawed. For example, economic agents do not always have perfect information, buyers and sellers do not 
always behave rationally or even in their own best interests, prices are not always set by an auction process, and it 
is sometimes not possible to purchase insurance to cover every eventuality. This has led to a backlash against the 
"microfoundations" proponents that is best summarized in the words of Paul Krugman [![, "■ ■ -the notion that macro 
is rotten but micro is in good shape is, well, only half right." 

As one might expect, the current situation provides some impetus for transplanting ideas from physics to economics, 
in the hope that the success of the former subject can be replicated in the latter. This was the goal of a now-famous 
meeting at the Santa Fe Institute in 1987 that brought together Nobel laureates in both subjects for this purpose. 
The field of "econophysics" was arguably born at this meeting, and much progress has been made in the years since. 
An outline of the history of the field is described in Beinhocker's book on the subject [2|, and its recent developments 
are broken down by country in a very informative recent special issue of the journal Science and Culture Q. 

An observation made by numerous authors (see, for example, Ref. [H) is that a useful analogy can be made with 
the early work of Boltzmann. When molecules collide, they exchange momentum and energy; when economic agents 
transact, they exchange wealth. If Boltzmann's equation describes the former process, then something similar to a 
Boltzmann equation should describe the latter. This paper pursues this analogy. 

There are, of course, essential differences between molecules and economic agents. For example, in Boltzmann's 
theory of the former, energy is shared amongst the molecules in a Maxwell-Boltzmann distribution. There are 
many hypotheses for the distribution of wealth in societies, and, while some of them involve the Maxwell-Boltzmann 
distribution in various limits, none are really that simple. 

One of the first attempts to quantify the distribution of wealth in a society was made by Vilfredo Pareto in the 
early twentieth century [5[ . He studied the distribution of land ownership in Italy by plotting the fraction of people 
with wealth greater than x versus x. It is clear from the definition of this curve that it is a non-increasing function 
of x. If we suppose that wealth is distributed according to the probability density function (PDF) P(w), so that 

J dw P(w) is the total population with wealth w € [a, b], then the function that Pareto plotted was 

AM := 1 r dw' P(w'), (1) 

where N := J" 00 dw P(w) is the total population. Differentiating both sides of this relation yields 

PH = -N d -f±, (2) 
dw 

so the PDF may be easily recovered from Pareto's function. 
Pareto found empirically that A(w) was well approximated by 

, / s / 1 if w < w min ,„h 

Mv) « ( (v^y otherwise, (3) 

where w m i n is a lower bound on wealth, and the exponent a is called the Pareto index. If the total wealth W := 
J Q dw P(w)w of the population is to be finite, it must be that a > 1. Using Eq. ([2]), we find the corresponding 
Pareto PDF, 

, . f if w < w min 

PH * I £~ otherwise. (4) 

The discontinuity of P p (w) at w = w m i n is worrisome, and most economists regard Pareto's observation as accurate 
only in some intermediate range of w. 
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FIG. 1. The Pareto index of the US economy: Actual data for the last century, taken from [f|. 



Pareto's law is sometimes equated with the "80-20 rule" that asserts that 20% of the population owns 80% of the 
land. In fact, this is implied by Pareto's law for a « 1.16, but it does not, by itself, imply Pareto's law. More 
generally, it is straightforward to show that Pareto's law can be made consistent with the observation that a fraction 
/ of the population has a fraction 1 — / of the wealth if 



Note that the "fair" situation with / = 1/2, in which half of the population owns half of the land, corresponds 
to a — > oo; the totally "unfair" situation, in which a vanishingly small fraction of the population owns all but a 
vanishingly small fraction of the land, corresponds to a — >■ 1 from above. The Pareto index for the economy of the 
United States over the last century @ is shown in Fig. [1] 

Although the details of the distribution of wealth in a society are controversial, the appearance of power laws in this 
context is widely accepted. Power laws are often associated with self-similarity, which, in this context, is manifested 
by the following observation: Denote the population with wealth between w/2 and w by N-, and that with wealth 
between w and 2w by N + . If Pareto's law holds]]], then N-/N+ = 2 Q , independent of w. That is, the ratio of people 
within a factor of two poorer than w to those within a factor of two wealthier than w is independent of w. 

Although Pareto's law has been known for more than a century, its microeconomic foundations are still a subject 
of active research. In the mid-1990s, an innovative class of models, called asset exchange models (AEMs), were 
introduced for this purpose. In this paper, we analyze a particularly interesting one of these, called the "Yard-Sale 
Model" (YSM), originally developed by Chakraborti [j, |8| a nd his coworkers, analyzed in some detail by Ispolatov, 
Krapivsky and Redner [g, [l(| , and popularized by Haves [lTj . 

The YSM consists of N economic agents, each endowed with only one quality, namely wealth w. In the simplest 
version of this model, w is a positive real number; that is, we do not allow agents to have negative net wealth. This 
feature is enforced in the initial conditions, and, as will become clear, the dynamics are designed to preserve it. 

The simplest version of the YSM is a closed economic system. The number of agents N remains constant. No 
wealth is imported, exported, generated or consumed, so the total wealth of the population W also remains constant. 
Wealth can only change hands, from one agent to another. Therefore, agents can become wealthier only at the expense 
of other agents becoming poorer. 

Neoclassical economics assumes that all agents are fully informed about their options, and all make decisions based 
on their own financial best interests. If this were really the case 0, no net wealth would ever change hands. Two 
agents might agree to exchange some wealth, but one or the other would refuse to enter into the transaction unless 
the wealth exchanged was equal. Economists refer to this state of affairs as perfect pricing. Under the assumption of 
perfect pricing, the exchange of wealth would leave P(w, t) unaltered. 



1 Here we assume that w/2 > M> m i n so that we are in the power- law regime. 

2 and if there were an absolute notion of value 




(5) 
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FIG. 2. The time loop of the basic Yard Sale Model algorithm: As the algorithm proceeds, we keep track of the 
distribution of agent wealth versus time. 



As described by Hayes [ll| and by Beinhocker [2j, perfect pricing does not happen in the real world. Real people 
make mistakes, and some people are more clever than others about this. It is unrealistic to expect that a person 
wishing to purchase a commodity will conduct an exhaustive search for the lowest price. More often, they will search 
only long enough to find an acceptable price. For these reasons, the wealth exchanged in transactions between agents 
may differ, and net wealth will change hands. The YSM describes the dynamics of this process. 

How much net wealth might be transferred from one agent to another in a given transaction? Let us suppose that 
the amount transferred must be strictly less than the smaller of the wealths of the two agents participating in the 
transaction. This will ensure that all agents maintain positive wealth. In practice, we shall say that the net change 
of wealth is a fraction /3 £ (0, 1) of the wealth of the poorer of the two agents. 

Once the net change of wealth has been determined, it remains to decide which agent loses it, and which agent wins 
it. Of course, if one agent is assumed to be more clever than all the others, he/she is more likely to be the winner. 
Such an assumption will have the effect of quickly concentrating wealth in the hands of the most clever agents. To 
give our model economy every benefit of the doubt, therefore, let us assume that the agents are equally clever, so that 
cither is equally likely to be the winner. 

These considerations lead to the simplest version of the YSM, which is described algorithmically in Fig. [5] Because 
AEMs are closed systems in which N and W are conserved, we expect that the distribution P(w,t) will approach a 
steady state, dependent only on the values of N and W, as t — > oo. Much of the following discussion is devoted to 
studying this limit. 

In Sec. [IT] we shall make precise many of the concepts introduced above. In particular, we describe the idea, familiar 
from kinetic theory, that the PDF of wealth P(w) may be understood as the ensemble average of a corresponding 
quantity in the Klimontovich representation. As in kinetic theory, we may begin with the Klimontovich representation 
to define multi-agent PDFs, and multi-agent correlation functions. 

In Sec. IIIIl we consider the kinetics of the YSM, by relating the time rate of change of the one-agent distribution 
to an integral over the two-agent distribution for this model. We derive this relation both by considering the outcome 
of a transaction between two agents, and from a master equation approach. We then introduce the random-agent 
approximation, which is the analog of Boltzmann's famous molecular chaos approximation, to derive the analog of the 
Boltzmann equation for the YSM. We demonstrate that this equation conserves N and W for a closed economy. We 
give an exact solution that is non-normalizable, but we present numerical evidence that it is valid between lower and 
upper bounds of wealth. We show that, in the long-time limit, these bounds tend to zero and infinity, respectively, 
as the result tends to a certain generalized function. The appendix contains a short detour through the theory of 
distributions in order to properly describe this generalized function. 

In Sec. II V[ we study a particularly interesting limit of the Boltzmann equation in which agents are allowed to stake 
only a small fraction of their wealth in any one transaction. In this small-transaction limit, the Boltzmann equation 
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reduces to an elegant partial integrodifferential equation that admits to a simple analysis. This equation is, we believe, 
new in this context, and one of the principal new results of this paper. We demonstrate that this equation admits 
the same conservation laws as the Boltzmann equation, and we present numerical simulations of its evolution. We 
show that its time-asymptotic limit is the same generalized function described in Sec. IIIII We conjecture that this 
evolution is approximately valid for many more complicated models of economies, such as the famous Sugarscape 
model of Epstein and Axtell [l2| • 

Finally, in Sec.|V]we show how the partial integrodifferential equation derived in Sec. lIVI can be extended to include 
effects such as production, inflation and taxation. We present the dynamical equations with these features included, 
in the small-transaction limit, but we relegate their numerical solution to future work. 



II. DEFINITIONS AND REPRESENTATION 



A. The one-agent density function 

As described in Sec.Q] the YSM supposes a population of N agents, each with some wealth w G R+. The one-agent 
density function is the PDF of agents in wealth space at time t, and is denoted by P(w,t), so that the number of 
agents with wealth w G [a, b] at time t is J dw P(w, t). If the time variable is clear from the context, we usually omit 
it; for example, we might abbreviate P(w,t) by P(w). 

The total number of agents is then given by the zeroth moment of P, 

/•OO 

N(t) = dw P(w,t), 
Jo 

and the total wealth of the agents is the first moment of P, 

/>OC 

W(t) = dw P(w,t)w. (7) 
Jo 

The average wealth of an agent is then W/N. In a closed economy, N and W are conserved quantities, independent 
of time. 



(6) 



B. Klimontovich representation of one-agent density function 



We consider a population of N agents with individual wealth Wj(t), where j — 1,...,N. The Klimontovich 
representation of the one-agent PDF is then 

N 

P K (w,t) = J2Hw-w j (t)), (8) 

3 

from which Eqs. © and (J7]) yield N(t) = N and W(t) = w j(t)i respectively. 

The Klimontovich representation retains the individual wealth of each agent in the population as a Dirac delta. 
For most purposes, this is far too much information to be useful. The representation that we would prefer is some 
smoothed version of this. We may smooth Pr by taking an ensemble average over many different populations of N 
agents, each evolving independently. These populations are distinct because their initial conditions may differ and 
because their time evolution may be stochastic. 

To represent the ensemble average mathematically, we add a (possibly multidimensional) ensemble label er, so 
that Wj(a,t) denotes the wealth of the jth agent in the trth population of the ensemble at time t. For simplicity, 
we insist that each population in the ensemble has the same number of agents N, and the same total wealth W — 
~^2^ w(cr, t). We follow common usage in statistical physics, and refer to an ensemble constructed with these constraints 
as microcanonical. The Klimontovich representation of the one-agent distribution of population a is then denoted 

N 

P K (a,w,t)=J25(w-w j (a,t)). (9) 

3 



6 



The ensemble averaged one-agent distribution is then the integral^ of this over some measure dp(a), normalized so 
that J dp(a) = 1. That is, the smoothed one-agent PDF that we use is given by 

N 

P(w,t) = / dp(a) P K (<J,w,t) = / dp(a)22 S 

(w - Wj(a, t)) . (10) 

Because our ensemble is microcanonical, Eqs. ((6]) and (0 still yield N(t) = N and W(t) = Wj(a,t), respectively, 
both quantities being independent of a. 

We note that, in passing from the Klimontovich representation Pk to the smoothed representation P, we have 
lost the discrete nature of N and W. In a real economy, agents are individuals (or other legal entities, such as 
corporations), and there is necessarily an integer number of them. Likewise, wealth is measured in some currency, 
and often rounded off to the minimum unit of that currency, or some rational fraction thereof. In the smoothed 
representation, however, N and W are generally real numbers. We will return to this point later in Subsec. IIII Gl 



C. Multi-agent density functions 

Similarly, we can define a two-agent density function^ at time t, denoted by P(w,w' ,t). That is, the number of 
ordered pairs of agents such that one has wealth between a and b and the other has wealth between c and d at time 

t is given by J dw J c dw' P(w,w' ,t). This two-agent PDF satisfies three important properties: 
(i) Because the total number of ordered pairs of agents is TV 2 , we must have0 

/•OO POO 

N 2 = I dw I dw' P(w,w',t). (11) 



(ii) Because the property of being paired is symmetric, we must have 

P(w,w',t) = P(w',w,t). (12) 

(iii) Because each agent may be paired with N others, integrating the two-agent PDF over the second variable and 
dividing by N must yield the one-agent density function, 

i r°° 

P(w,t) = - J dw' P(w,w',t). (13) 
To better understand the two- agent PDF, we first consider its Klimontovich representation 

N N 

P K (w,w',t) =^^25(w-Wj(t))5(v/-w k {t)). (14) 

i k 

It is manifest that this factors, 

P K (w,w',t) = P K {w,t)P K (w',t), (15) 

so that the Klimontovich representation of the two-agent PDF is the product of two one-agent Klimontovich PDFs. 
With this observation, the three properties in the foregoing paragraph follow immediately. 

As with the one-agent PDF, the Klimontovich representation of the two-agent PDF contains much more information 
than we need, so we smooth it by taking an ensemble average, 

- , N N 

P(w,w',t)= / dp{a) Pk(v,w,w' ,t) = / dp(a) S (w — Wj (a, t)) 5 (it/ — (a, t)) . (16) 



3 Note that an average over a finite or countable number of ensemble elements would still yield a singular distribution. To obtain something 
smooth, the Dirac deltas of the Klimontovich representation need to be integrated over a continuum. Some authors avoid this problem 
by the notational dodge of angle brackets (■) for the ensemble average, defined so that {S(w — Wj)) is somehow smooth. We eschew this 
sleight of hand because it evades the real issue: The Klimontovich distribution is a generalized function, so it belongs inside an integral. 
The angle brackets must be the integral over some measure, so it is best to denote them as such. 

4 It is possible to define a two-agent density function for multiple times as well. For example, we could define the PDF for finding one 
agent with wealth w G [a, b] at time t and another with wealth w' £ [c, d\ at time t' . For the purposes of this paper, however, the 
single-time version with tf = t is all we need. 

5 Note that, because these are ordered pairs, we count the pairing of an agent with wealth w with another with wealth w' as distinct from 
the reverse. We also include pairings of agents with themselves. This is why the total number of pairs is N 2 . 
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As a consequence of the ensemble average, the smoothed two-agent PDF no longer factors into a product form, but 
we can write 

P(w, w',t)= P(w, t)P(w',t) + C{w, w', t), (17) 
where we have defined the two-agent correlation function 



C(w,w',t):= J dp(a) fcsiw-Wjfaty-Pfat)] ^ 5 (w' - w k (a, t)) - P(w', t)j , 



(18) 



which may be thought of as the excess probability of finding a pair of agents, over and above the product of the 
probabilities of finding each individually. As with one-agent PDFs, we sometimes suppress the time dependence, 
writing for example P(w,w') and C(w,w'), instead of P(w,w' ,t) and C(w,w',t), if the time is obvious from the 
context. 

It follows from the definition of the two-agent correlation function that 



0= / dw C(w,w',t) = / dw'C(w,w',t) (19) 
Jo Jo 

and 

C(w,w',t) = C(w',w,t), (20) 

and from these one can verify that P(w,w' ,t) still satisfies properties (i) through (iii) above, even though it is no 
longer a product form. 

Likewise, p-agent PDFs for p > 2 can also be expressed as product forms supplemented by connected correlation 
functions. 



III. BOLTZMANN EQUATION FOR DENSITY FUNCTION 



We now consider the problem of deriving a dynamical equation for the one-agent PDF, P(w,t), of the YSM. 
Because agents gain or lose wealth due only to transactions with other agents, we expect that the rate of change of 
the one-agent PDF depends on the two-agent PDF, and indeed this turns out to be the case. We shall derive this 
result both by considering a transaction between a pair of agents, and then again by a master equation approach. 



A. Pair interaction between agents 

The scenario where one agent with wealth w wins and one with wealth vf loses is described by 

w = w + a min (w, w') (21) 
w = w — a min (W, W ) , (22) 

where w > w is the new wealth of the winning agent, w' < w' is the new wealth of the losing agent, and a G [0, 1) 
is the fraction of the smaller initial wealth that is exchanged in the transaction. Equations (|2Tj) and (|22]l describe a 
bijection on with inverse 

a . ( 1 — a A 

w = w — min w, w I (23) 



1 — a \X + a 

. . a . ( 1 — a. A , . 

w = w H mm w, w . (24) 

1 — a \1 + a J 

The Jacobian of this transformation is straightforwardly calculated to be 

T , d(w,w') 1 n ( i 1 — a \ 1 „ / 1 — a A 

J(w,w') = ' = ——e w' - ——w + 9 -—w -w'\, 25 

o(w,w') l + a \ 1 + a J 1— a \l + a J 

where 9 is the Heaviside function. 
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B. Derivation of dynamic equation for density function 



If we suppose that a pair with wealth (w, w') at time t transforms into a pair with wealth (w, w') at time t + At 
with probability XAt, we must have 

P(w,w',t + At)dwdw' = (XAt)P(w,w' ,t)dw dw' + (1 - XAt)P(w,w' ,t)dw dw' (26) 

or, employing the Jacobian, Eq. ([25]) . 

P(w,w',t + At)dw dw' = (XAt)P(w,w' ,t)J(w,w')dw dw' + (1 - XAt)P(w,w' ,t)dw dw' . (27) 

If we cancel dw, integrate over dw' and divide by N, we obtain 

XAt "*> 



P(w,t + At) 



N 



dw' P{w,w',t)J(w,w') + (1 - XAt)P(w,t), 



(28) 



where it is understood that w and w' are functions of w and w' as given by Eqs. (1231) and (|24p. We subtract P(w,t) 
from both sides, divide by At and let At —> to find 



dP(w,t) _ 1 



dw' P(w, w', t)J(w, w') - P(w, t), 



(29) 



where we have absorbed A into the time scale. Finally, using Eqs. (|23|) . (|24|) . (|25|) and (|13j) and some straightforward 
calculation, we find the rate equation, 



dP(w, t) 
dt 



P(w,t) 



1 



-P 



1 + a V 1 + a 



-,t 



1 

'N 



dw' 



P (w — aw' , to , t) 



1 



-P 



1 + a \l + a 



w 



-,w',t 



(30) 



Equation pop is incomplete because we have not yet taken into account the equal possibility that the agent with 
wealth w could lose, and that with wealth w could win. The rate equation for that case can be derived exactly as 
above, but it is easy to see that the result differs from Eq. (|3"0"]) only by the substitution a —> —a. Because agents 
win or lose with equal probability, the correct total rate is the average of the two, so the rate equation for the wealth 
distribution becomes 



dP{w,t) 
dt 



P(w,t) 



-i f^dw' 



1 



-P 



1 



27V. 

— r 

2 Wo 



2(1 + a) \l + a 
P (w — aw' , w' , t) 



P 



2(1 -a) V 1 -" 



-P 



w 



dw' 



P (w + aw , w , t) 



l + a V 1 + a 



-P 



-,w',t 



1 — a \ 1 — a 



,w',t 



(31) 



Without this averaging of positive and negative rates, the resulting kinetic equation would not conserve the total wealth 
of the population, as we shall demonstrate in Subsec. IIII Fl In Subsec. IIII CI we consider an alternative derivation of 
Eq. (jniD- 

We note that Eq. pip can be written in the form 



dP{w,t) 
dt 



-l 

1 fTT/J 



d/3 ?7(/3) j- 
dw' 



P(w,t)- 



1 



-P 



P(w- f3w',w',t) 



1 + 13 V! + P 
1 



1 + 13 \l + (3 



w 



,w',t 



where -q is the PDF of the fraction a and is given by 



VW) :=\8{p-a)+ 1 -8{p + a) 



(32) 



(33) 
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in the above example. Note that we still regard a as confined to the interval [0,1), but f3 G (— 1,+1). This form 
suggests that we could adopt a more general form for rj(/3), as long as we retain the normalization d/3 rj(/3) = 1. 
For example, by allowing the choice 

= if Jf l<a (34) 

[ otherwise, v ' 

we model the situation in which the fraction of the poorer agent's wealth that is at stake is uniformly distributed in 
[0, a]. In any case, we demand that rj be an even function so that each agent has equal win and loss probabilities in 
each interaction. 



C. Master equation approach 

As has been pointed out by Ispolatov, Krapivsky and Redner 0, an excellent way to understand the origin of 
the terms in equations such as Eq. pip is to express them in the form of a master equation as follows 

dp ^^ = i r dw > r dw » p( w ", w ') [s(w" - w) 

ot N J J 

+-0 (w-(l + a)w') S (w" -W + aw') 

+ i(9 ((1 + a)w' -w)S (w"(l + a)-w) 

+~6 (w - (1 - a)w') S (w" -w- aw') 

+- 9 ((1 - a)w' -w)S (w"(l - a) - w)} . (35) 

We can think of the terms of Eq. (|35j) as describing an agent with wealth w" entering into a transaction with another 
agent with wealth w' . The Dirac delta on the top line is a loss term; if w" — w, the transaction results in the loss of 
an agent with wealth w. The four succeeding Dirac deltas are source terms, and may be justified as follows: 

(i) In the first source term, the agent with wealth w" > w' wins wealth aw' from the agent with wealth w', and 
becomes an agent with wealth w — w" + aw' > (1 + a)w' . 

(ii) In the second source term, the agent with wealth w" < w' wins wealth aw" from the agent with wealth w' , and 
becomes an agent with wealth w = (1 + a)w" < (1 + a)w' . 

(iii) In the third source term, the agent with wealth w" > w' loses wealth aw' from the agent with wealth w' , and 
becomes an agent with wealth w = w" — aw' > (1 — a)w' . 

(iv) In the fourth source term, the agent with wealth w" < w' loses wealth aw" from the agent with wealth it/, and 
becomes an agent with wealth w = (1 — a)w" < (1 — a)w' . 

Note that each possibility (i) through (iv) supposes a win or a loss, and so each has a probability of one half. 
Performing one or both integrals in each term of Eq. (|35|) quickly yields Eq. (|3"T|) . 



D. Random-agent approximation and Boltzmann equation 

Equation (|31l) expresses the rate of change of the one-agent distribution in terms of the two-agent distribution. We 
could proceed by writing an equation for the two-agent distribution, but it would involve the three-agent distribution. 
This approach leads to an infinite hierarchy of equations, similar to the BBGKY hierarchy of statistical physics. 

To truncate the hierarchy, we need to make an approximation. Referring to Eq. (|17l) , we see that we can make the 
approximation of ignoring the correlation C(w, w' ,t), so that the two-agent PDF is assumed to be a product of two 
one-agent PDFs. In the context of kinetic theory, this is Boltzmann's famous molecular chaos approximation; in this 
context, we refer to it as the random-agent approximation. 

The random-agent approximation assumes that two agents entering a transaction are uncorrelated. It is of ques- 
tionable validity. We violate it every time we frequent the same grocery store, instead of choosing one randomly. We 
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will discuss the shortcomings of the random-agent approximation in Sec. IVIl For now we note that its application to 
Eq. (|3"!2)) yields a self-contained dynamical equation for the one- agent PDF, 



dP{w,t) 
dt 



dp vW) - 



P(w,t) - 



1 



-P 



■ t 



1 f 1+0 



N 



dw' 



1 + /3 \l + /3 

1 _ / w 



P{ W -p W >,t)-—P{- 



P(w',t) 



(36) 



Equation (|36[) is strongly reminiscent of Boltzmann's celebrated kinetic equation of statistical physics. Certainly, the 
term with the integral over w' on the right-hand side has the general appearance of an integral collision operator with 
quadratic nonlinearity. We pursue this metaphor in Subsec. IIIIEI 

E. Comparison with statistical physics 

Boltzmann's kinetic equation of statistical physics is written for the one-particle PDF, f(r,v,t), where r denotes 
position and v denotes velocity, and the evolution equation for this PDF has the form 



df(r,v,t) 
dt 



= -vS7f(r,v,t) + n{f](r,v,t), 



(37) 



where f2[/](r, v, t) denotes a quadratically nonlinear integral collision operator whose detailed form is discussed at 
length in standard physics textbooks, and need not concern us here. 

It is interesting to compare the first term on the right of Eq. (j3"7| to that of Eq. (j3"!?|) . To address this, we rewrite 
this term in Eq. (f3"T|) as a finite difference 



v ■ Vf{r,v,t) 



[f(r,v,t) - f(r - vr,v,t)] 



(38) 



where t is small. We note that both this term and the first term on the right-hand side of Eq. (|3"!?)) involve the PDF 
minus a distortion of itself due to the action of a Lie group. In Boltzmann's kinetic equation, the Lie group is that 
of galilean transformations, r — > r — vt. In the Boltzmann equation that we have derived for the YSM economy, the 
Lie group is that of affine scalings w —> w/{l + /3). Just as molecules move in physical space by addition of — vt, 
agents move in wealth space by multiplication by 1/(1 + 0). Equation (|32|1 may therefore be understood as a variety 
of Boltzmann equation that bears the same relation to the affine group as the physical Boltzmann equation bears to 
the Galilean group. 

This observation strongly suggests that we should investigate the small-/? limit of Eq. (|32[) by considering PDFs 
rj{P) that have support only in the vicinity of the origin. We shall examine this limit in Sec. IIV1 



Conservation laws 



In this subsection, we demonstrate that the quantities ./V and W, defined in Eqs. ([6]) and Q, are constants of the 
motion of Eq. (|36|) . 



1. Conservation of agents 



To demonstrate that the total number of agents, given by Eq. ©, is conserved, we first note that 

dP(w,t) 



dN d f°° , „. , 
— = — / dw P(w,t) 
dt dtj v ' ' 

+ 1 r roc 

dpviP){-J dw 

1 r°° fTFP 
-— I dit- I the 



dw 



dt 



P(w,t)- 



1 



-P 



1 + 13 \l + f3 



o 







P(w- I3w',t) - 



-P 



1 + \l + (3 



P(w',t) 



(39) 
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where we have exchanged the order of integration over f3 and w in the second line. It follows that N will be conserved 
if the right-hand side vanishes. In fact, we will show that the two terms in the curly brackets vanish separately. 
First, we note that a simple change of integration variable in the first term establishes that 



Next, we note that 



1 

N 






Jo 


dw 






f-OC 




N J 













N j 







o, 





dw 



dw' 
dw 1 P(w',t) 
dw' P(w',t) 



P{w,t) 



1 



-P 



1 + P \1 + P 



= 0. 



P(w- /3w',t) 
dw 



-P 



P{w',t) 



dw P (w, t) 



1 + P \1 + P' 

p ^-^*)-i^ p (t^'*) 

dw P (w, t) 



(40) 



(41) 



where we have changed the order of integration in the first step, and made two different substitutions in the second 
step. Combining Eqs. (|3"5]l. (gDJ and (gj), we find 



dN 

1T = ' 



(42) 



as expected. 



2. Conservation of wealth 

Likewise, to demonstrate that the total wealth of the population, given by Eq. ([7]), is conserved, we first note that 



dW d f°° , , , f c 
-—— = — / dw wPfw.t) = I 
dt dtj v ' J 



dw w 



dP(w,t) 



dt 



dp V tf) 



-— j dw if 



dw w 



P(w,t) 



-P 



1 + 13 \l+/3 



w 



i+fi 



dw' 



P(w- f3w',t) - 



-P 



1 + 13 \l + /3 



P(w',t) 



(43) 



where we have exchanged the order of integration over (j and w in the second line. It follows that W will be conserved 
if the right-hand side vanishes. This time, we shall show that the two terms in curly brackets are both odd functions 
of f3, so that when they are integrated along with the even function t]((3), the result vanishes. 
A simple change of integration variable in the first term establishes that 



dw w 



P(w,t) 



1 



-P 



1 + 13 \1 + P 



-0W, 



(44) 



which is proportional to j3. We also have that 



1 

NJ Q 
1 

~ N 
1 

~ N 

= L 

N 



dw w 



dw' 

dw' P(w',t) 
dw' P(w',t) 



P(w- Pw',t) 



-P 



1 + P \1 + P' 



P(w',t) 



dw w 



(1+/3V 
oo 



P(w- f3w',t) 



-P 



1 + /3 \l + p' 

r CO 

dw (w + (3w')P (w, t)-(l + /3) / dw wP (w, t) 

J w' 

dw' P(w',t) I dw P(w,t)(w' -w), 



(45) 
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which is also proportional to j3. In Eq. (|45]) . we have changed the order of integration in the first step, and made 
two different substitutions in the second step. Combining Eqs. (|4"3")l . (|4"4")) and (|4"5)) , and invoking the evenness of the 
function r)(0), we find 

as expected. 

Note that wealth conservation follows from the average of the rates of change for the winning and losing scenarios, 
reflected in the evenness of r){jf), as described in the discussion leading from Eq. ([50)) to Eq. (|5U|) . Wealth is not 
conserved by the winning and losing scenarios separately. For example, the foregoing argument should make it clear 
that Eq. (|3"Uj) , by itself, does not conserve total wealth. 

G. Solutions 

1. Exact solutions 

Ispolatov, Krapivsky and Redner 0, [lj| investigated the Boltzmann equation obtained from applying the random 
agent approximation to Eq. pop, and found that it admitted an exact solution proportional to (wt)^ 1 . In fact, such 
solutions exist for the much more general Eq. (|36[) . Because Eq. (|36l) is manifestly invariant under time translation 
symmetry, these solutions can more generally be written as 

P^t) = ^ TT) , (47) 

where T is an arbitrary constant, which should be positive to avoid a singularity at finite time, and where the constant 
C is given by 

N 

C= - r . (48) 

The integral in the denominator in Eq. (|48[) is a constant depending only on the choice of the symmetric function 
7?(/3) used in the model. For example, the choice of Eq. (|33|) results in 

N 

C= — r —^ , (49) 

N 

C = ; 777-TTTTT- ( 50 ) 



and that of Eq. (|34j) results in 



1 + ^-ln 



(1 — q) 1 



2ce 111 ^ (l+a) 1 + c 

At first glance, the existence of such exact solutions might seem very useful. Unfortunately, a solution proportional 
to w^ 1 for all w is not normalizable. It has an infinite number of agents and an infinite total wealth. That is, neither 
of the integrals in Eqs. ([5]) and jTJ) are finite for these solutions. The constant parameter N in Eq. (|4T|) is the same 
one that appears in Eq. (|36[) . but it no longer has any connection with the number of agents. 

In spite of the fact that this solution is non-normalizable, we shall see that it is very useful in understanding the 
long-time behavior of solutions for P(w,t). 

2. Numerical solutions 

We have performed simulations with populations of N — 5 x 10 4 agents, each given an initial allocation of 100 units 
of wealth, so that W = 5 x 10 6 . In these simulations, we took ?y(/3) to be of the form given in Eq. (|3"3"]) . with a — 0.25. 
Using infinite-precision arithmetic, we ran the simulation for up to 10 9 transactions and, following Pareto, we plotted 
the fraction of agents with wealth greater than w, namely 

1 f°° 

A(w,t):=- dw'P{w',t), (51) 
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FIG. 3. Log-linear Pareto plots of wealth distribution: Taken from simulation for 50,000 agents, each with an initial 
allocation of 100 units of wealth and a = 0.25. 

versus w. These results are presented on log-linear plots for various times in Fig. [31 in which three regimes are clearly 
visible. 

• For sufficiently small values of w, we see A(w,t) ps 1. This indicates that P(w,t) goes to zero for small enough 
w, so the lower limit of integration in Eq. (|51l) may be replaced by zero. It makes sense that P(w, t) should 
vanish for sufficiently small w. After all, at the beginning of the simulation, all the agents had 100 units of 
wealth. Even an agent who lost in every one of his interactions would still have 100(1 — a) n > units of wealth 
remaining after n transactions. That said, it should be noted that the regime in which A(w, t) » 1 is restricted 
to extremely small values of w indeed. Remember that it is the logarithm of w that is plotted on the abscissa in 
the graphs in Fig- El At time t = 10 8 , for example, note that the constant- A regime is confined to law < —150, 
or w < e~ 150 . (This is why we used infinite-precision arithmetic in our calculations.) We refer to this bound as 
w m i n , so this regime is defined by w < w min . 

• Figure [3] also suggests that A(w, t) ~ for sufficiently large w. This indicates that P(w,t) also goes to zero for 
large enough w. We refer to this bound as w max , so this regime is defined by w > u> max . Once again, this is 
reasonable, this time because there is a bound W on the total wealth of the population. Indeed, it may seem 
that it must be that iz; max must be strictly less than W, but one must be careful about this. It is true in our 
simulation because we have discrete agents; as a statement about Eq. (1361) . however, it is not true, because, 
as noted earlier, agent discreteness is lost in this representation, so we might well have a "half an agent" with 
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wealth 2W. We will return to this point in more detail later. 

• For intermediate values of w, i.e., u> m i n < w < w max , the curves in Fig. (J3j> fit well to straight lines with negative 
slope. In this regime, we evidently have A(w, t) « b(t) — alt) In w, and differentiating both sides with respect to 
w yields P(w,t) » This looks remarkably like the exact solution presented earlier, but it is truncated for 
both low and high wealth. 

The foregoing discussion suggests that, at any given time t, to a reasonable approximation, P(w,t) has most of its 
support only on a finite interval, [w m i n (t), w max (t)]. Thus our numerical results fit well to the approximate solution 
P(w,t) w P c (w,t), where 



P c (w,t) 



w 

otherwise 



for w min (t) <w<w n 



<(*) 



from which it follows that 



for w < w min (t) 
A c (w,t) := < a(t)log ( Wm ™^ for w min (t) < w < w max (t) 



(52) 



(53) 







for iu max (t) < w, 



where the notation reflects the fact that a, wj m i n and w max all depend on time t. These quantities cannot all be 
independent, however, since they must satisfy 



N = I dw P c (w,t) = a(t) In 
Jo 



<(*) 



w min (t) 



and 



Solving these for w max (t) and w m i n (i), we find 



W = I dw P c (w, t)w = a(t) [w max (t) - w min (t)} 



N 



W , (N 
u; min = -csch - ,exp 2q 



(54) 



(55) 



(56) 



and 



W , f N\ ( N 

Wmax = — CSCh — exp + — 

Za \ la / \ la 



(57) 



Here we have suppressed the explicit dependences on time i, but the point is that the time dependence of a determines 
that of iWmin and of w max . This dependence is plotted in Fig. 21 from which it is evident that large values of a correspond 
to the egalitarian situation at early times, when everybody has approximately 100 units of wealth. Small values of a 
correspond to the situation at later times when there is a broad spectrum of wealth amongst the agents. One might 
surmise, therefore, that a(t) decreases in time, and we now turn our attention to measuring the rate at which it does 
so. 

Given the data in Fig. [H the easiest quantity for us to measure is w m i n (t). We fit the intermediate region of the 
curve - the part with negative slope - to a straight line, and determine where it intersects the horizontal line A = 1. 
Given w m i n (t) calculated in this fashion, we solve Eq. (|56l numerically for a(t), and plot l/a(t) versus t. The result, 
shown in Fig. [SJ fits remarkably well to the straight line l/a(t) « 3.93264 + 0.0000204046i using a least-squares fit. 
The slope is close to the value of 1/N = 0.00002. To within a multiplicative constant of order unity, we therefore 
conjecture the following approximate form for a(i), 



a(t) 



N 
T + t' 



(58) 



where T = N/a(0). 

Combining Eqs. ((52]) and ([58]). we see that, in the interval [w m i n (i),w max (t)], our fit is very similar to the exact 
solution given in Eq. (|47|) . Outside this interval, however, P c (w) vanishes. We emphasize that P(w,t) = P c (w,t) is 
merely a numerical fit, and it is not a (weak) solution of Eq. (1361) . as can be verified by direct substitution. It can 
also be verified by noting that A c (w,t) has slope discontinuities at w m i n (t) and w max (t), whereas the numerically 
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FIG. 4. Plot of Wmin and w max versus a: Computed from Eqs. (f56l) and ((57|) for N = 50, 000 agents, and W/N — 100 units 
of wealth. The right-hand side of the plot corresponds to an egalitarian situation where most agents have wealth in the vicinity 
of 100 units. As time increases, a decreases, leading to a wide range of wealth in the population, from the very poor to the 
very rich. 
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FIG. 5. Plot of l/a(t) versus t: Taken from numerical simulation by fitting to determine w m in(t) and solving Eq. (|56|) 
numerically for a(t), as described in text. 



measured A(w,t) in Fig. [3] seems smooth. It is remarkable that this crude truncation of Eq. (|47|) does as well as it 
does in helping us understand the numerical results, but it docs not explain them exactly. 

Eqs. (f52|) and (f53|) differ from the Pareto distribution of Eqs. ((4]) and |3j) in two significant ways. First, there is an 
upper bound w max as well as the lower bound w m - ln . Second, the effective Pareto index is a = for this model. The 
resulting distribution is normalizable only because of the introduction of the upper cutoff u> max - 

As mentioned earlier, measured values of the Pareto index are always greater than one, as in Fig. [TJ so it should be 
reemphasized that this is a very idealized model, and that we are not claiming that it models real economies. More 
realistic models can be obtained by adding embellishments to this model, as will be described in Sec. [V] To pursue 
the metaphor with statistical thermodynamics, this model is the analog of the ideal gas law; no real economy obeys 
it, but it is such a useful idealization that it is worth careful study by anybody who endeavors to understand real 
economies. 



3. The long-time limit 



To what does the solution P(w,t), or its approximation P e (w,t), converge in the limit of large t or, equivalently, 
small a? Because the process is a martingale, there cannot be a stationary solution that is a well behaved function, 
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but we might expect that P(w,t) and P c (w,t) converge to the same generalized function or distribution^ as t — > oo. 
In the Appendix, we consider the nature of this generalized function, which we denote by C(iu), an d hi what function 
space it exists. The reader who is willing to accept at face value the statement, "It converges to something that looks 
like a delta function at zero wealth, except that, somehow, it has a positive first moment," may skip the presentation 
in the Appendix without fear of losing the overall thread. 

IV. A PDE FOR THE YARD SALE MODEL DENSITY FUNCTION 

A. The small-transaction limit 

In some circumstances, it is reasonable to assume that the largest fraction of an an agent's wealth that may be lost 
in one transaction is small. Most sensible people, after all, do not stake large fractions of their wealth on a single 
transaction. In that case, it is reasonable to expand the expression in curly brackets in Eq. (|36p in a power series in 
j3. In doing so, we may note that this expression vanishes when j3 — 0, so there is no constant term. The next term 
of the power series, proportional to /3, will contribute nothing when it is integrated alongside the even function n(/3). 
Hence, the first term that contributes is that of order (3 2 . The result, after some work, may be cast in the remarkably 
simple form 



dP 
~dt 



8 2 
dw 2 



—A + B\P 



(59) 



where we have absorbed the constant factor f\ d/3 r\(jS){5 2 into the unit of time t. Here A{w,t) is Pareto's function 
defined in Eq. ([5"Tj) . and we have defined 



i r w w' 2 

B(w,t);=- J dw'P(w',t)—. 



(60) 



Recall that A(w,t) is non-increasing with w, with A(0,t) = 1 and \im w ^ ^ A(w,t) = 0. By contrast, B(w,t) is 
non-decreasing with B(0,t) = 0, and lim 11 ,_ i . 00 B(w,t) not necessarily finite. Both A(w,t) and B(w,t) are functionals 
of P, so Eq. (f59|) is nonlinear. 



Conservation laws in the small-transaction limit 



Before seeking solutions to Eq. (I59I) . we should check that we have retained the conservation laws in the limiting 
process. Eq. (|59j) is clearly in conservation form 



dP dJ N 
dt dw 

where we have defined the flux of agents in wealth space, 



0, 



(61) 



JjV 



d_ 

dw 



w- 



A + B) P 



2 A+B hw-- wAR 



(62) 



Because J/v vanishes at the boundaries w = and w — > oo, conservation of agents follows immediately by integration 
of Eq. (|6TT) over all w. 

Note that the quantity fiN '■= (w 2 A/2 + B) P emerges as a kind of chemical potential for agents in wealth space, 
because its gradient drives the flux of agents, Jn — — d^N /dw. 
Next note that we may write 



= w ap +w ajjv _ 

dt dw dt 



(63) 



We shall use these two terms interchangeably. 
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FIG. 6. Sample PDF and associated fluxes: A sample distribution P(w) (in red, solid), the corresponding agent flux 
Jjv(w) (in green, dashed), and the corresponding wealth flux Jw{w) (in blue, dot-dashed). 



which is also in conservation form 

d dJ w „ 

^ (wP) + ^T = ' 

where we have defined the flux of wealth in wealth space, 



Jw = wJn + fj, 7y = — w- 



IN 



dw 



, N = - W [-A + B\-- 



T A-B)P. 



(64) 



(65) 



Because Jw also vanishes at the boundaries w — and w — > oo, conservation of wealth follows immediately by 
integration of Eq. (IM1) over all w. 

It is instructive to plot the agent flux and wealth flux as functions of w for a sample distribution. This plot is 
shown in Fig. [6]for the arbitrarily chosen distribution P(w) — 50000we~ w , which is normalized to 50,000 agents, and 
is plotted as a solid curve in red. The corresponding Jn(w) is plotted as a green dashed curve, and J\y{ w ) as an blue 
dot- dashed curve. 

Figure [5] makes evident that there is a threshold for agents in wealth space; the bulk of the agents below this 
threshold tend to move downward, while the elite above it tend to move upward. Likewise, there is a different 
threshold for wealth; a minority of the wealth below this threshold tends to move downward, while the majority of 
wealth above it tends to move upward. The agent threshold is on the tail of the distribution, significantly higher than 
the wealth threshold. That is, a small fraction of the agents and a large fraction of the wealth move upward. In this 
model, the rich become richer and the poor become poorer. 



Numerical simulations in the small-transaction limit 



It is much more straightforward to simulate the PDE in Eq. (|5§|) . with A given by Eq. (|51l) and B given by Eq. (|60l) . 
than it is to simulate Eq. (|5rj|) . We have done this using a finite-difference method for the arbitrarily chosen initial 
PDF, 



P{w,0) cx 



exp 




(10-u))(iu-4) 



for 4 < w < 10 
otherwise, 



(66) 



which has support on [4, 10], and we plot the results in Fig. [7] The results illustrate a fast evolution to a curve 
proportional to w^ 1 in a bounded region, followed by the expansion of that region and concomitant reduction in 
magnitude of the curve, presumably approaching the singular function £(u>) described in the Appendix. At the end 
of the Appendix, we show that ((w) is a stationary state of Eq. ([51?]) in a weak sense. 
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FIG. 7. Numerical solution to Eq. (|59[| : A finite-difference method was used to solve Eq. (|59[) for P(w,t), given the initial 
condition in Eq. (|66|) . The result clearly illustrates the approach to a curve proportional to , followed by the eventual 
approach to the singular distribution C( w )- 



D. Discussion 



We have presented a Boltzmann equation for the YSM, and, in the small-transaction limit, we have shown that 
this reduces to a PDE. Both are integrodifferential equations, though the second is easier to understand and simulate 
than the first. Both agent-based numerical results from the Boltzmann equation, and a finite-difference simulation of 
the PDE reveal a strong tendency to drive increasing fractions of wealth into the hands of a decreasing minority of 
agents. In both cases, we conjecture that the time- asymptotic state of the system is a generalized function Ci w ) that 
has all of the N agents condensed to zero wealth, while retaining a positive first moment W . 

One might wonder if this approach to a singular state indicates that the model is lacking. After all, even idealized 
agent-based models of microeconomics are much more complicated than the YSM. As an example, consider the famous 
"Sugarscape" model of Epstein and Axtell [l2|]- A condensed explanation of Sugarscape may be found in Beinhocker's 
book 0, but even this explanation indicates that Sugarscape is vastly more complicated than our simple YSM. 

In Sugarscape, agents have many features other than simply wealth. For example, they have spatial location, and 
they can move about on a two-dimensional grid, searching for "sugar" and "spice," and trading with other nearby 
agents. They also have a built-in algorithm that controls their movements and actions based on their environment. 
In the more sophisticated versions of the model, agents die for lack of sugar and breed when they have excess sugar. 
There are also versions of the model in which the agents can sexually reproduce, with each parent passing along 
features of their algorithm to their offspring. 
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FIG. 8. Wealth distribution in "Sugarscape": These plots are histograms of the number of agents versus wealth in Epstein 
and Axtell's Sugarscape model. Time runs downward from an arbitrary initial distribution in the top figure to something that 
looks remarkably like what is observed in the YSM. (Figure taken from Epstein and Axtell [l2j with permission.) 



Like us, Epstein and Axtell started the agents in Sugarscape with various initial distributions of wealth to see how 
these distributions would evolve, and they plotted their results versus time. One of their time sequences is reproduced 
in Fig. [5] Time runs downward in this figure. In spite of all the complications present in Sugarscape, the evolution 
shown in Fig. [5] is immediately familiar; indeed, the qualitative resemblance to Fig. [7] is striking. A least-squares fit 
on a log-log plot Q reveals that the penultimate plot in Fig. [8]fits well to u> -1 - 36 , and the last figure fits well to w~ 124 . 
These correspond to Pareto a values of 0.36 and 0.24 - not normalizable unless cutoffs are assumed. These results 
are not so far removed from ours. 

These observations suggest an Occam's Razor argument that the YSM captures at least some of the essential 
features of Sugarscape, and there is no denying that the YSM is much simpler to understand and simulate. Because 
I suspect that this paper will be read by economists as well as physicists, an additional transcultural cautionary 
word is warranted here. Economists are naturally suspicious of the suggestion that correct macro predictions of a 
theory justify its microfoundations. To them, this conjures up images of Friedman's 1953 arguments defending the 
assumption of perfect rationality in microeconomics simply because it (sometimes) produces correct results. That is 
certainly not what is being suggested here. Sugarscape, while still very idealized, is far more realistic than the YSM. 
In fact, it exhibits emergent phenomena, such as the growth of trade routes, that are not even defined in the YSM. 

To a physicist, the fact that the YSM is able to explain some of the emergent phenomena of Sugarscape, such as 
power- law P with a < 1, can only be regarded as a positive outcome. Physicists have a long history of idealizations that 
have advanced human knowledge, from elliptic planetary orbits (Kepler), to arrows on a grid representing magnetic 
domains (Ising). All of these idealizations are known to be unrealistic, and yet all of them have led to leaps in our 
understanding. All we are suggesting here is that the YSM has a key place in the hierarchy of idealizations that 
constitute our understanding of real economic phenomena. 



discarding histogram entries with zero agents 
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V. ADDITIONAL FEATURES 
A. The importance of wealth redistribution 

Real economies seem to have Pareto exponents that are greater than one. It is often claimed that a > 1 is necessary 
in order for the Pareto PDF to be normalizable. As we have seen, however, this argument is valid only if we assume 
no upper cutoff. Real economies have discrete agents, so wealth can not concentrate beyond the extreme of one agent 
having all of it, and this in itself sets an upper cutoff. As Pareto himself observed, there is also usually some social 
safety net for the poor, setting a lower cutoff. With such cutoffs, there is nothing stopping the PDF between them 
from having a Pareto index less than unity, and this is precisely what we have found in both the YSM and Sugarscape 
models described above. 

This naturally raises a question: If normalizability is not the reason that a > 1 is observed in real economies, then 
what is the reason? We suggest that real societies have wealth redistribution mechanisms that naturally increase a. 
It could be that real societies become politically unstable if a is too small. Whatever the reason, most societies have 
taxation on wealth or income, and most governments use the revenues thereby generated to build infrastructure to 
improve the lives of all. 

There are other mechanisms preventing the uncontrolled concentration of wealth. Countries allow immigration to 
increase N, and they mine natural resources (among other things) to increase W. Central banks can print currency. 
Agents may make successful investments outside the country, thereby increasing their own wealth. All of these features 
may impact the distribution of wealth. We consider a few such features in the following subsections. 

Recall that we have studied the YSM at two different levels of description, namely the Boltzmann equation in 
Sec. IIII1 and the PDE to which it reduces in the small-transaction limit in Sec. IIV1 We could introduce new features 
at either of these two levels of description. In what follows, we continue to use the small-transaction limit because it 
is more elegant and tractable. There is nothing preventing the use of a similar approach for the Boltzmann equation. 

Suppose that a certain mechanism changes the wealth of an agent at a rate f(w) that depends only on that agent's 
wealth w. Then, to first order in At, we must have 

P(w, t)dw = P(w + f(w)At, t + At)dw'. (67) 

If we Taylor expand the right-hand side and retain terms only to first-order in At, we find 

dP d 

ar + ^^ = °- (68) 

Taking the zeroth moment of Eq. (|68)) . we see that it conserves agents. Taking the first moment, we see that Eq. (|68|) 
may not conserve wealth. All of the examples that follow will conserve agents, so we shall use this general approach. 

The observations in this section will be restricted to the derivation and exposition of appropriate dynamical equa- 
tions. Numerical modeling of economies with these extra features will be reported in a future paper [13j. 

B. Production 

Suppose that a society produces wealth £ per unit time, perhaps from an extraction industry of some sort, and that 
it divides the wealth thus produced evenly among its N agents. Then f(w) = £/N. If this mechanism were the only 
one present, the rate equation for the PDF would be 

^ + A flp] = o (69 ) 
dt + dw \N J { ' 

Eq. (|69p is a one-sided wave equation with wave speed £/N. As noted, it conserves the number of agents N. Taking 
the first moment, however, we see that the total wealth of the society satisfies 

In this model, therefore, W grows linearly in time. 

If we suppose that production occurs in addition to YSM wealth exchange, the full differential equation becomes 



dP d ( £ \ d~ I a- 



dt dw \N J dw 2 



2 r/„„2 



—A + B I P 



(71) 



Because we have already demonstrated that the YSM terms on the right conserve both N and W , this combined 
model will have constant TV and linearly increasing W . Because of the linear increase of W, the model never reaches 
a stationary state, but an appropriately scaled version of it may do so. 
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C. Inflation 



Suppose that all agents are able to loan their wealth to external borrowers who pay them an interest v per unit 
time. Then f(w) = vw, so if this mechanism were the only one present, the rate equation for the PDF would be 



dP d 

Once again, Eq. (fT2"|) conserves agents, but the total wealth of the society obeys 

dW w 



(72) 



(73) 



demonstrating that W grows exponentially in time, with time constant v, as expected. 

If we suppose that this mechanism is present in addition to YSM wealth exchange, the full differential equation 
becomes 



dP_ + d_ , p v _ 

dt dw dw 2 



—A +B\ P 



(74) 



Once again, because we have already demonstrated that the YSM terms on the right conserve both N and W, this 
combined model has constant N and exponentially increasing W. Again, because of the exponential increase of W, 
the model never reaches a stationary state, but an appropriately scaled version of it may do so. 



D. Taxation 



Finally, suppose that all agents are assessed a wealth tax of r percent per unit time. The amount of tax paid by 
an agent with wealth w is tw. Integrating this over the distribution, we see that the total tax taken from the society 
is tW. If we suppose that this total tax revenue is divided evenly and redistributed amongst the N agents, we find 
that f(w) = —tw + tW/N . If this mechanism were the only one present, the rate equation for the PDF becomes 



dP 
~dt 



d_ 

dw 



W 

iV 



w P 



0. 



(75) 



Eq. ([75]) conserves both N and W. Because it continually redistributes wealth, it is not surprising that it admits the 
generalized stationary solution P(w) = NS(w — W/N), in a weak sense, as is readily verified. This equitable solution 
corresponds to an infinite Pareto index. 

If we suppose that taxation is present in addition to YSM wealth exchange, the full differential equation is 



dP 
~dt 



d_ 

dw 



W 

Iv 



w P 



d 2 
dw 2 



w 



A + B) P 



(76) 



This combined model will conserve both ./V and W, and is interesting in that the terms on the left-hand side drive the 
Pareto index to infinity, while those on the right-hand side drive it to zero. We might hope that together they would 
lead to power-law solutions with intermediate values of the Pareto index, closer to those observed in real economies, 
but it is straightforward to verify that a simple power law will not work, even with upper and lower cutoffs. Further 
analytic and numerical investigation of this equation will be the subject of future work (l3| . 



VI. CONCLUSIONS 



The analogy between transacting agents and colliding molecules has been pointed out by a number of authors 
(see, e.g., Yakovenko Q). We have pursued this analogy and derived a general Boltzmann equation governing wealth 
distribution in the Yard-Sale Model (YSM), with careful attention to all of the assumptions that must go into such a 
derivation, such as the random-agent approximation. 

We presented strong analytical and numerical evidence that the dynamics of the YSM make the rich richer and the 
poor poorer, inexorably driving the distribution of wealth to a decidedly singular state with vanishing Pareto index. 
The asymptotic state of the dynamics is one in which all but a vanishingly small fraction of the agents have zero 
wealth, even while the first moment of the wealth remains positive. In the Appendix, we introduced the functional 
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analysis necessary to make this last statement rigorous, describing the asymptotic state as a generalized function £ 
which is different from the Dirac delta at zero wealth. 

We then introduced the small-transaction limit in which the Boltzmann equation reduces to a simple partial 
differential equation, and presented numerical evidence that this equation has the same singular limit as the Boltzmann 
equation. To the best of our knowledge, this PDE has not been posited before in the context of wealth dynamics, and 
is therefore one of this paper's principal new contributions. 

We pointed out that other more detailed artificial society models, such as Sugarscape [l2j also exhibit dynamics 
which drive the Pareto index to values less than unity. We refuted the usual argument proscribing this, based on 
the non-normalizability of the wealth distribution. With lower and upper cutoffs that approach zero and infinity, 
respectively, at just the correct rates, there is nothing preventing a power-law wealth distribution with Pareto index 
less than unity. 

Finally, we showed how this model may be extended to include phenomena which are likely to lead to stationary 
states with more realistic values of the Pareto index. The detailed analytical and numerical examination of these 
models will be the subject of future work [l3j . 

There are many ways in which this work can be expanded and extended. We can add extra variables to the 
agents, such as spatial position. We can examine the development of correlations between transacting agents, and the 
corrections that these make to the random-agent approximation. We can examine the possibility of transactions that 
involve three or more agents at a time, instead of just pairs of agents. We can also examine steady states of Eq. (|76[) . 
It is hoped that this presentation will encourage more work along these lines. 
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Appendix: Description of the time-asymptotic limit 

As noted in the text, the function P(w, t) and its approximation P c {w, t) approach a generalized function as t — > oo. 
This generalized function has support only at the origin, and has zeroth moment equal to N. This suggests the limit 
NS(w), but we additionally require that it have first moment W. In the function space L2, this additional requirement 
is impossible to satisfy. We are forced to the conclusion that the dynamics of wealth can evolve P(w, t) to something 
outside L2 in the t — > 00 limit. The appropriate function space in which to study the time asymptotics of wealth is 
therefore a larger function space than Li- This Appendix describes the functional analysis that is necessary to make 
this statement rigorous. The discussion is meant to be self-contained, requiring little prior background in the subject. 
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Our numerical simulations clearly indicate that the asymptotic state of the system has N — 1 agents in a state of 
abject poverty, and one with all the wealth W. As noted in the text, however, this division between N — 1 poor agents 
and one wealthy agent is due to the discrete nature of the simulation. If we could simulate the continuous distribution 
of agents governed by Eq. (|3T>1) . we might expect to see ever smaller "fractions of agents" / with wealth W/f, alongside 
N — f agents living in poverty. If / — > in the time- asymptotic limit, we might expect that everybody eventually 
ends up poor, in some sense, so that a good generalized function candidate for limt-^oo P(w, t) or lim^oo P c (w, t) 
might be N5(w). Indeed, this view is reinforced by noting that Eq. (I36p can be rewritten in the suggestive form 



dP(w,t) 



at 



dP r?(/3) 



N 



dw' 



P(w- 0w',t) 



-P 



1 + \l + 



[P(w',t)-N6(w')], 



(A.l) 



where the notation —0 for the lower limit of integration is meant to emphasize that the Dirac delta is entirely contained 
within the region of integration. This form makes clear that P(w,t) = N5(w) is a steady state solution of Eq. (|36l) . 
It is also zero for w > 0, consistent with the a — > limit of a/w. 

The trouble with the above view is that, although the generalized function P(w) — NS(w) obviously satisfies 
Eq. ^ @, it does not satisfy Eq. ([7]), except in the trivial economy that has W — 0. This will not do. We need a 
generalized function defined for w € [0, oo), that has the following three properties: 

(i) £(w) — for w > 

(ii) f Q QC dw(H=N 

(iii) / °° dw C(w)w — W. 

The first two of these are reminiscent of the "physicists' definition" of (N times) a Dirac delta. As is well known, the 
apparent absurdity of these simultaneous demands was resolved mathematically only by the advent of the theory of 
distributions by Sobolev, Schwartz and others between the 1930s and the 1950s [141] . The question facing us now is 
how to use distribution theory to define a generalized function £ with all three of the above properties. 

Distribution theory requires a space V of test functions ijj{w) that are smooth and have bounded support El- 
Generalized functions are then associated with linear functionals on this space. The action of a functional / on a test 
function ip is a map T> — > K, and the real number that results is usually denoted (/, ip). For example, the functional 
5 defined by (6, ip) = ip(0) is the Dirac delta. It is easily seen to be a linear functional, since 



+ c 2 ip 2 ) = (ci^i + c 2 V ; 2)(0) = ciV'i(O) + c 2 ^ 2 (0) = ci(5,ipi) + c 2 (6,ip 2 )- 



(A.2) 



In this way of thinking, 5 is not a function of w; rather, it is a functional on T>. We may then revert to writing 
J dw 5(w)ip(w) in place of (S, ip), but it must be understood that this is an abuse of notation. There is never any 
question about what the value of S(w) is at a particular w. Whenever ambiguity arises, we turn to the interpretation 
of d as a linear functional on T> to resolve it. An excellent introduction to distribution theory may be found in, for 
example, the first few chapters of the text by Griffel 14| . 

To put the generalized function £ on a firm footing, we need more requirements on our space of test functions. Let 
us first consider the space Q of test functions ip that are smooth and have bounded support on [0, oo), and for which 



F[ip] := 



• |y>M-y>(o)| . 

dw < oo. 



10 w 

The reader may verify, for example, that the test function 



ip(w) 




w ( 1 — w ) 



for < w < 1 
otherwise 



(A.3) 



(A.4) 



belongs to Q. By contrast, the functions ip(w) = 1 and ip(w ) = w do not belong to G, because they do not have 
bounded support; in the latter case, there is also the problem that F applied to tp is not finite. 



8 with a lower limit of integration of —0 as above 

9 A function is smooth if it is infinitely differentiable. A function has bounded support if the set of w for which it is nonzero (more precisely, 
the closure of that set) is a subset of [a, b] for some real a and b. 
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We should first verify that Q is indeed a linear space. We do this by supposing that we have two test functions 
ipi £ G and ?p 2 G £?• This means that both 0! and ip 2 are smooth and have bounded support on [0, oo), and that 
F[ipj] < oo for j = 1, 2. We now consider the linear combination Ci^i + C 2V'2- It is clear that this combination is also 
smooth and has bounded support on [0, oo). We then note that the linear combination satisfies Eq. (|A.3[) . since 



POO 

F[aipi + c 2 ip 2 ] = j dw 
Jq 



< / dw 



|ci^i(w) + C 2 1p 2 (w) - CjlpijO) - C 2 V>2(0)| 

w 

lciV'l(w) - ClV>l(0)| + \c 2 1p 2 (w) - 0202(0)1 







w 

< | Cl |F[0i] + \c 2 \F[ip 2 ] 

< 00, (A. 5) 

where we have used the triangle inequality. So Q is closed under linear combinations, and thereby qualifies as a linear 
space. 

In fact, Q is not quite big enough for our purposes. We want the functions (f)(w) = 1 and </>(u>) = w and constant 
multiples thereof to be in our space of test functions, but, as noted above, they are not in Q. So we next define Gi to 
be the space of functions that are the sum of a function in Q and any linear function of w. That is, for each (f> £ Gi, we 
may write <fi(w) = %j){w) + 7 + /iu>, where ip £ Q and 7, /i £ R. Moreover, we shall demonstrate that this decomposition 
is unique. For any function 4> £ Gi there are unique real numbers 7 and fj,, such that ip(w) = 4>{w) — 7 — fiw G Q. 

Before showing how to compute 7 and /1, we should make an incidental comment: The principal reason for using 
test functions with bounded support in distribution theory is to allow us to integrate by parts, discarding surface 
terms with reckless abandon. Note that we can do this in the space G, but we will need to be a bit more careful in 
the space Gi because lim u; _j. 00 4>'(w) = /i. 

To calculate 7 and \x from 4> G Gi, note that lim u ,_ ! . 00 ((^(w) — /iw) = 7 follows from the fact that tp has bounded 
support. The approach is then to show that \x is the unique real number for which the limit lim^-joo (4>(w) — /iu>) 
exists, and that for this value of \x the value of the limit is 7. 

To see that this approach defines fi uniquely, let us suppose that there were two values [i\ and fi 2 for which the 
limit existed. That is, suppose that 

lim (4>(w) — /iiw) = 71 (A. 6) 

w—toc 

lim (cf>(w) - fj, 2 w) = 72 (A. 7) 

w— >oc 

are both finite and real. Since both limits exist, we can subtract these equations to obtain 

lim [(//2 - Mi) to] = 71 - 72, (A. 8) 

w— ^00 

but there is no way that this last statement can be true, unless fj,\ = pbi. Uniqueness of 7 then follows immediately. 

The unique association of <p £ Gi with the constant /1, such that lim„,_ ! . 00 ((/)(w) — fiw) exists, is itself a linear 
functional, which we shall call H; that is, we write (S, </>) = fi. To demonstrate linearity of H, let us suppose that 
<fij £ Gi, so that lim UJ _ ) . 00 (4>j(w) — fijw) = 7, exists, and we can write (S, <pj) — fij for j — 1,2. By taking a linear 
combination of these limits, it follows that 

lim [(ci^i(w) +c 2 4>2(w)) - (a hi +c 2 ^2)w] = C171 + c 2 7 2 (A. 9) 

w— too 

also exists, so 

(3, ci0i + c 2 (f> 2 ) = C1H1 + c 2 /i 2 = ci(S, (f>i) + c 2 (S, (f> 2 ), (A. 10) 

thereby demonstrating linearity of the functional 5 and justifying our notation. 

Armed with our space Gi of test functions and the functional 5, we are now ready to make sense of the generalized 
function described earlier. In the language of distributions, £ may be written 

C = NS + WE. (A.ll) 

That is, for any test function (f> 6 Gi, where 4>(w) = ip(w) + 7 + )iw with ip £ G, we have 



(t,cj>}=N<t>(0) + Wn. 



(A.12) 
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As with (5, we may now abuse notation by writing the above as follows 

dw C(w)</>(w) = N<j)(0) + Wfi. 



Setting 4>(w) = 1, we find 7 = 1 and /i = 0, so it follows that 



dw C(w) = N. 



Setting 4>(w) = w, we find 7 = and fj, = 1, so it follows that 



dw £(w)w = W. 



(A.13) 



(A.14) 



(A.15) 



Thus, the generalized function ( satisfies Eqs. © and ©. 

Note that Gi may be characterized as the space of functions whose second derivative is in Q. (If we have <fi(w) = 
ip(w) + 7 + fiw, then clearly <f>(w) and i/j(w) have the same second derivative.) This observation relates Gi to a class of 
function spaces known as Sobolev spaces, but elaboration of this point would take us beyond the scope of this paper. 

Can we now prove that the function P c , defined in Eq. (1521) . converges weakly to f in the limit as a — > 0? For an 
arbitrary <f> E Gi, and for /i = (S, (f>) and ?p(w) = 4>(w) — 7 — fj,w £ G, we consider the quantity 



(P C -CA)\ = 



dw [P c (w) — C(w)} 4>{w) 
dw P c (w) [iJj(w) + 7 + fiw] - N(f>(0) - W\i 
dw P c (w) [i/j(w) - ip(0)] 



< \a\ 



< a 



dw 



dw 



|V(«0-V(o)| 



°° dw \^ w )-m\ 

w 



< M\a\, 

where M = F[ip] < 00 because ip G Q, and where we used the fact that P c was constructed to obey Eqs. 



It follows that 



Um|(P c -C^>| 



0. 



(A.16) 

and 0. 

(A.17) 



Because Eq. (|A.17|) holds for arbitrary test functions (j> € Gi, we can conclude that P c converges weakly to C i n the 
limit as a — > or t — > 00 in the function space Gi- Our numerical evidence then strongly suggests that P obeying 
Eq. p6|) likewise converges weakly to £• This last point is, of course, not proven by the above arguments, but we offer 
it as a very plausible conjecture. 

Finally, we can show that the generalized function (,(w) described above is also a weak stationary state of the 
dynamical equation for the small-transaction limit, Eq. (|59|) . To see this, we examine the integral of the right-hand 
side of Eq. (fBTJf multiplied by an arbitrary test function <f> £ Gi , 



cj){w) 



dw 2 



w 



A + B C 



dw. 



Writing <p(w) = ip(w) + 7 + fiw as before, and integrating by parts twice, we find 

P, }(dw. 



f 



d 2 4>(w) ( w 2 

dw 2 It 



(A.18) 



(A.19) 



To evaluate this last integral, let f{w) := ip"(w)(w 2 A/2 + B). Then the integral is equal to Nf(0) + Wfi, where \x is 
the unique number such that lim^^oo [f(w) — fiw] exists. We first note that /(0) = because w 2 A/2 + B vanishes 
at w = (and ip is smooth). Then [i — follows from the fact that ip has bounded support. So the integral vanishes, 
and the generalized function C( w ) 1S a stationary state of Eq. (1591 in this weak sense. We conjecture that it is the 
stationary state to which arbitrary initial conditions generically attract. 



